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Title: System and Method For Constructing and Personalizing a Universal 
Information Classifier 

Cross-Reference to Related Applications 

5 This application claims the benefit of U.S. Provisional Application 

Serial No. 60/268,198, entitled SYSTEM AND METHOD FOR PERSONALIZING AN 
INFORMATION CLASSIFIER and filed on February 12, 2001 . 

Technical Field 

1 o The present invention relates generally to information classification systems and 

more particularly to personalizing an information classification system (e.g., spam filter, 
relevance filter, urgency filter). 

Background of the Invention 

1 5 As the volume and variety of information available to information recipients has 

increased, the ability to filter signal information from noise information in such an ever- 
expanding volume and variety has become increasingly important. For example, an 
information recipient may receive one hundred email messages per day, of which ten are 
urgent, seventy five are regular email messages (not urgent) and fifteen are junk mail 

20 ("spam") and thus not urgent. The ability to classify such messages, to produce, for 

example, a list ordered on urgency, can increase productivity of the information recipient 
by facilitating prioritizing and focusing on important messages. 

Conventionally, static information classifiers have been produced that facilitate 
classifying information on parameters including, but not limited to, urgency, relevance to 

25 task, likelihood of being spam and likelihood of being of interest to an information 

recipient. But static information classifiers suffer from the problem that one man's 
garbage is another man's gold. Thus, what may be spam to a first information recipient 
may be an urgent message to a second information recipient. More generally, different 
information recipients may have different information receiving goals and priorities, 

30 which pre-trained classification systems may not be able to accommodate. 

Thus there remains a need for a system and method to improve classification 
systems to facilitate accounting for personal attributes associated with the information 
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recipient {e.g., preferences, usage patterns, task at hand) and/or individuating attributes of 
the received information. 

Summary of the Invention 

5 The following presents a simplified summary of the invention in order to provide 

a basic understanding of some aspects of the invention. This summary is not an extensive 
overview of the invention. It is not intended to identify key or critical elements of the 
invention or to delineate the scope of the invention. Its sole purpose is to present some 
concepts of the invention in a simplified form as a prelude to the more detailed 

1 0 description that is presented later. 

The present invention relates to a system and method for personalizing a classifier 
to account for attributes associated with an information recipient and/or individuating 
attributes of received information. The classifier to be personalized may be a pre-trained 
classifier that was constructed using generalized training sets of data. Such training sets 

1 5 may have been collected from a set of homogenous and/or heterogeneous users 

depending on initial operation desired in the classifier. Such default or "universal" 
classifiers can be trained to analyze a subset of a larger number of potentially 
discriminatory features, where the subset of features have the ability to discriminate 
among categories in a relatively universal manner for different individuals. Thus, 

20 classifiers can be based on fewer or more abstract features during training and operation 

for creating a default filter that is employable by a variety of users, and that can be 
utilized in a system that allows individuals to build more specialized filters through 
implicit or explicit training. For example, although a pre-trained, default classifier can be 
constructed using detailed findings about items being classified, such as multiword 

25 phrases and/or single words, more generalized classifiers, designed to be employed by a 

variety of information recipients may be pre-trained without such specific details. Thus, 
such detailed features as particular phrases and/or single words remain to be employed in 
personalizing the pre-trained classifiers. 

Such generalized pre-trained classifiers may be trained by considering more 

30 universal, or higher level features including, but not limited to, header structure, indicia 

of junk mail (eg., percentage non-alphanumeric characters, capitalization patterns), 
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relationships in an organization chart, length of messages, times, dates and other 
linguistic features like tense usage, presence and number of questions. Such features may 
typically be more generalizable in their influence on classifier behavior, and thus less 
sensitive to changing users than more specific features, whose relevance and influence 

5 may be more dependent on nuances of different users. When an information recipient 

receives a personalizable pre-trained classifier, adapting data maybe employed in 
personalizing the pre-trained classifier. Such adapting data may include explicit data 
provided to the information recipient for the purpose of training the personalizable 
classifier. For example a set of two hundred sample messages for the information 

1 0 recipient to classify may be provided, with the classifications performed by the 

information recipient being employed to modify the generalized pre-trained classifier 
and/or to produce a new, personalized classifier. The adapting data may additionally 
and/or alternatively include data gathered by observing user behavior in working with 
received information. By way of illustration, the classifier being trained may receive a 

1 5 message, and then may take actions including, but not limited to, immediately looking at 

the message and responding to it, immediately looking at the message and archiving it, 
immediately looking at the message and discarding it, and letting it sit in an in-box for a 
period of time. Such actions, and other data gathered by observing the data (collectively 
the "action data"), may be compared to a classification prediction made by the classifier 

20 with the relationship between the observed action and the prediction being employed to 

update the classifier being personalized. 

The adapting data may be employed to personalize a classifier in manners 
including, but not limited to, updating the pre-trained classifier and/or producing a new, 
separate classifier. When information to be processed by the classifier being 

25 personalized is received, such information may be processed by the original pre-trained 

classifier, the personalized classifier being constructed, and/or a combination of the pre- 
trained classifier and the classifier being personalized. When combined processing is 
undertaken, the weight to be associated with classifying processing performed by the pre- 
trained classifier and/or the classifier being personalized may be adjusted based on 

30 factors including, but not limited to, the stage to which the personalization of the 

classifier has progressed and the breadth of classifying features encountered in the 
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adapting data. By way of illustration, a threshold amount of adapting data may be 
considered necessary to provide a confidence level in classifications made by the 
personalized classifier. Thus, the weight accorded to classifications made by the 
personalized classifier will be increased as the amount and/or quality of adapting data 
5 employed in personalizing the classifier increases. After the threshold amount of 

adapting data has been employed in personalizing the classifier, the classification of 
subsequently received information may be performed by the personalized classifier, with 
the original classifier taking a substantially insignificant role. 

The classifiers can include, but are not limited to classifiers constructed using 

1 0 techniques known in the art like support vector methodologies, Bayesian processing (e.g., 

naive Bayesian, sophisticated Bayesian), similarity analysis (e.g., employing dot product 
and/or cosine function processing) and decision trees. Such classifiers may classify 
information on different levels. By way of illustration, a first classifier may be a 
document classifier, while a second classifier may be a document classifier employing 

1 5 text and feature analysis. 

The personalization process may progress in a desired manner, but there can be 
situations where the personalization process may not progress in a desired manner. Thus, 
the present invention provides a method, via the inclusion of temporal data associated 
with updates to the classifier being personalized, to back out of the personalization, 

20 returning the classifier to an earlier state (e.g., its original state). Such temporal data also 

facilitates personalizing a filter for different periods of time. For example, during a first 
period of time, an information recipient may have been performing a first type of 
processing and receiving a first type of information, that lead to a first personalization of 
the filter but during a second period of time, the information recipient may have been 

25 performing a second type of processing and receiving a second type of information that 

lead to a second type of personalization. Temporal data associated with such time 
sensitive personalization can include, but is not limited to, fields that hold information 
concerning the creation date of an item, fields that hold information concerning the 
receipt data of an item, fields that hold information concerning a time before which an 

30 item is or is not to be considered and fields that hold information concerning a time after 

which an item is or is not to be considered. 
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An information recipient employing a classifier that is personalizable may desire 
information concerning the personalization process, and may further desire to have some 
input into the personalization process. Thus, the present invention provides a user 
interface operable to perform actions including, but not limited to, outputting information 
5 concerning the degree to which the personalization process has progressed (e.g., 75% 

personalized), inputting information concerning the desired size of the adapting data set 
to be employed in determining the relative weighting of classifications from the pre- 
trained classifier and the classifier being personalized (e.g., sliding control for adapting 
data set size between 100 and 1000 messages), inputting information concerning the 

1 0 period of time for which personalizing adapting data should be employed (e.g., calendar 
control for specifying training times to include/exclude) and inputting information 
concerning a point in time to which the information recipient would like to reset the 
classifier (e.g., sliding control employed to remove 50 to 5000 messages from adapting 
data set to force retraining). 

1 5 As personalization progresses, information more recently employed to personalize 

the classifier may be more relevant than information less recently employed to 
personalize the classifier. Thus, the present invention provides for aging out information, 
and/or the influence of such information, from the classifier. By way of illustration, if 
five hundred messages are required to be observed before results from a personalized 

20 classifier are considered valid, then once five hundred messages have been received, 

recently received messages may cause messages received at an earlier point in time to be 
removed from the adapting data set. Such "aging out" may be performed by methods 
including, but not limited to, physical removal and removal by weighting factor. Physical 
removal may involve deleting a record from a data store associated with the classifier 

25 while removal by weighting factor may involve applying a formula to a data record, 

where the relevance of the data record is modeled by a weighting factor, with data 
records that are older receiving reduced weighting factors, potentially down to a 
weighting factor that substantially removes the data record from a classification decision. 
The weighting factor can vary based, at least in part, on the age of an item being 

30 considered. For example, as a document and/or set of documents age, the weighting 

factor applied to such documents may decrease and in one example of the present 
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invention, may decrease to a point (e.g., 0%), where the document and/or set of 
documents are no longer considered. However, a user may desire to override such aging, 
and thus, in one example of the present invention, the user may configure the present 
invention to prevent the "aging out by reduced weighting factor" of a document and/or 
5 set of documents. 

One aspect of the present invention provides a system for personalizing an 
information classifier. The system includes a first classifier that is pre-trained with 
training data and that is operable to produce a first measure related to a message being 
classified as having one or more characteristics. The characteristics may be, for example, 

1 0 a probability that a message is of a known type, a message priority, an urgency score, a 

computed expected urgency in an email prioritization system and the like. The system 
also includes a second classifier that is trained with adapting data and that is operable to 
produce a second measure related to the message being classified as having one or more 
characteristics. The system further includes a weighting component that is adapted to 

1 5 assign a first weight to the first measure and a second weight to the second measure so 

that a combining component can combine the first measure and the second measure to 
produce a third measure associated with the message being classified as having one or 
more characteristics. The combining component can perform the combination of the 
measures based, at least in part, on the first weight and the second weight. The system 

20 further includes an aging component adapted to modify the relevance of one or more 

messages based, at least in part, on time-based information associated with a message and 
an adapting component operable to modify the second classifier so that the second 
classifier becomes more personalized than the first classifier. The system further 
includes a user interface that is operable to display information concerning the 

25 personalization of the second classifier and to accept information concerning retraining 

the second classifier. 

Another aspect of the present invention provides a method for personalizing an 
information classifying process. The method includes receiving a message to be 
classified and producing a first measure concerning the classification of the message, 

30 where the measure may be, for example, a probability that the message is of a known 

type, a message priority, an urgency score, a computed expected urgency in an email 
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prioritization system and the like. The method then includes producing a second measure 
concerning the nature or classification of the message. Once the two measures have been 
produced, the first measure is combined with the second measure to produce a third 
measure associated with the classification of the message. The third measure can be 
5 produced by applying a first weight to the first measure and a second weight to the 

second measure. Such measures can be combined, for example, by a function F = (1 - 
w)mi + w(m 2 ), where w is a weighting value that may vary over time, mi is the first 
measure and m 2 is the second measure. The use of this form of the weighting equation 
leads to outputs that have the same range as the two input measures. The method further 

10 includes updating data stores, data structures, algorithms, processes, threads and/or rules 
employed in generating the second measure. The updating may be based, for example, 
on a relationship between the first measure and the second measure with the updating 
being employed to personalize the generation of the second classifier. 

To the accomplishment of the foregoing and related ends, certain illustrative 

1 5 aspects of the invention are described herein in connection with the following description 

and the annexed drawings. These aspects are indicative, however, of but a few of the 
various ways in which the principles of the invention may be employed and the present 
invention is intended to include all such aspects and their equivalents. Other advantages 
and novel features of the invention may become apparent from the following detailed 

20 description of the invention when considered in conjunction with the drawings. 

Brief Description Of The Drawings 

Fig. 1 is a schematic block diagram illustrating a system for personalizing an 
information classifier, in accordance with an aspect of the present invention. 
25 Fig. 2 is a simulated screen shot of a user interface employed with an aspect of the 

present invention. 

Fig. 3 is a schematic block diagram illustrating an information classifier where a 
first classifying component and a second classifying component reside in the same 
component, in accordance with an aspect of the present invention. 
30 Fig. 4 is a graph illustrating the progression of the personalization of an 

information classifier, in accordance with an aspect of the present invention. 
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Fig. 5 is a flow chart illustrating one particular methodology for carrying out an 
aspect of the present invention. 

Fig. 6 is a flow chart illustrating another methodology for carrying out an aspect 
of the present invention. 

5 Fig. 7 is a schematic block diagram of an exemplary operating environment for a 

system configured in accordance with the present invention. 

Detailed Description 

The present invention is now described with reference to the drawings, where like 

1 0 reference numerals are used to refer to like elements throughout. In the following 

description, for purposes of explanation, numerous specific details are set forth in order 
to provide a thorough understanding of the present invention. It may be evident, however 
that the present invention may be practiced without these specific details. In other 
instances, well-known structures and devices are shown in block diagram form in order to 

1 5 facilitate description of the present invention. 

As used in this application, the term "component" is intended to refer to a 
computer-related entity, either hardware, a combination of hardware and software, 
software, or software in execution. For example, a component may be, but is not limited 
to being, a process running on a processor, a processor, an object, an executable, a thread 

20 of execution, a program and a computer. By way of illustration, both an application 

running on a server and the server can be components. One or more components may 
reside within a process, and/or thread of execution and a component may be localized on 
one computer and/or distributed between two or more computers. 

As used in this application, the term "message" is intended to refer to an 

25 information communication. The message may be, for example, an email message, a 

web page, a file, or other identifiable sets of information. 

It is to be appreciated that various aspects of the present invention may employ 
technologies associated with facilitating unconstrained optimization and/or minimization 
of error costs. Thus, non-linear training systems/methodologies (e.g., back propagation, 

30 Bayesian learning, linear regression, non-linear regression, or other neural networking 
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paradigms including mixture of experts, ,radial basis functions, and other statistical or 
logical inference methods) may be employed. 

Referring initially to Fig. 1, a schematic block diagram illustrates a system 100 
for personalizing an information classifier. The system 100 receives a message 1 10. The 
5 message 1 1 0 may arrive from and/or be stored in a data store 1 50 that holds adapting 

data. The message 1 10 may be, for example, an email message, a web page, and/or other 
information containing messages. A first classifier 120 processes the message 1 10 to 
produce a first probability that the message 1 1 0 can be classified in a certain way. By 
way of illustration, an arriving message 1 10 may be classified according to whether it is 

1 0 urgent or not urgent. By way of further illustration, an arriving message 1 1 0 may be 
classified according to whether the message 1 1 0 is junk mail ("spam"). While the 
previous two examples refer to binary classifications, it is to be appreciated that the first 
classifier 120 can produce measures associated with higher degree classifications (e.g., a 
message priority). The first classifier 120 can be a pre-trained classifier that was 

1 5 constructed using one or more training sets of data that were collected from one or more 

homogenous and/or heterogeneous users depending on the initial operation desired in the 
first classifier 120. Although the pre-trained classifier could have been constructed using 
fine-grained data including detailed phrases and/or single words, producing a more 
generalized classifier that can produce adequate results for a variety of information 

20 recipients may be accomplished using data that has features that have relatively universal 
discriminatory relationships for multiple people. Thus, the fine-grained data (e.g., 
detailed phrases and/or single words) may be employed in personalizing the pre-trained 
classifiers and/or in producing a new, more personalized classifier. The first classifier 
120, as a more generalized information classifier, thus may have been trained by 

25 considering higher level features including, but not limited to, header structure, indicia of 

junk mail (e.g., percentage non-alphanumeric characters, capitalization patterns), 
relationships in an organization chart, length of messages, times, dates and other 
linguistic features like tense usage, presence and number of questions. 

The first classifier 120 may process the message 1 10 using techniques including, 

30 but not limited to techniques known in the art like support vector methodologies, 

Bayesian processing (e.g., naive Bayesian, sophisticated Bayesian), similarity analysis 
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(e.g., employing dot product and/or cosine function processing) and decision trees. The 
first classifier 120 may classify information on different levels. By way of illustration, a 
first classifier may be a document classifier, while a second classifier may be a document 
classifier employing text and feature analysis. 
5 The first classifier 120 may store the message 1 10 and/or information concerning 

the message 1 10 in a data store 150 associated with the first classifying component. The 
data store may be, for example, a database, a table, an array, a list, a file, a queue and a 
data cube. Storing the message 1 1 0 and/or information concerning the message 1 1 0 
facilitates evaluating how a probability was calculated, facilitates adapting (e.g., 

10 personalizing) an information classifier and facilitates specifying periods of time that 

may be relevant to adapting an information classifier. Concerning such time periods, the 
stored information may include temporal data that can be employed in computing the 
relevance of an aging item. By way of illustration and not limitation, as an item (e.g., a 
document) ages, its relevance to the personalization of a filter may decrease. Thus, the 

1 5 data store 1 50 can include fields that hold information concerning the creation date of an 

item, fields that hold information concerning the receipt data of an item, fields that hold 
information concerning times before which an item is or is not to be considered and fields 
that hold information concerning times after which an item is or is not to be considered. 
The system 100 also includes a second classifier 130. The second classifier 130 

20 processes the message 1 1 0 to produce a second measure associated with the message 1 1 0 

being classified in a certain way. It is to be appreciated that the second classifier 130 can 
produce measures including, but not limited to, measures associated with a message 
priority, measures associated with the classification of a message as a known type, a 
measure associated with an urgency score, and a measure associated with a computed 

25 expected urgency in an email prioritization system and the like. Although the second 

classifier 130 and the first classifier 120 are illustrated as separate components, it is to be 
appreciated that the first classifier 120 and the second classifier 130 may reside in the 
same computing system and/or processing system, and may access similar and/or related 
data stores and/or structures. In one example of the present invention, where the first 

30 classifier 120 and the second classifier 1 30 are distributed between two or more 

communicating processes, the present invention may employ a data packet that is 
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transmitted between the two or more computer processes. The data packet may include 
information related to personalizing an information classifier where the information 
includes, but is not limited to, weighting data, aging data and adapting data. 

Concerning the classifications produced by the classifying components, the 
5 arriving message 1 1 0 may be classified according to its urgency. By way of further 

illustration, the arriving message 1 10 may be classified according to whether the message 
1 1 0 is junk mail ("spam"). While the previous two examples refer to binary 
classifications, it is to be appreciated that the second classifier 130 can produce measures 
associated with higher degree classifications (e.g., very urgent, somewhat urgent, not 
1 0 urgent, to be completely ignored) and other characteristics (e.g., message priority, 

urgency score, computed expected urgency in an email prioritization system). 

The second classifier 130 may process the message 1 10 using techniques 
Q including, but not limited to techniques known in the art like support vector 

IB methodologies, Bayesian processing (e.g., naive Bayesian), similarity analysis (e.g., 

jTj 1 5 employing dot product and/or cosine function processing) and decision trees. The second 

"0 classifier 1 30 may also store the message 1 1 0 and/or information concerning the message 

ji 110 (e.g., temporal data, source data) in a data store associated with the second classifier, 

j!^ The data store may be, for example, a database, a table, an array, a list, a file, a queue and 

jfj a data cube. Storing the message 1 1 0 and/or information concerning the message 1 1 0 

m 20 facilitates evaluating how a probability was calculated, facilitates adapting (e.g., 

f: personalizing) the second classifier 130 and facilitates specifying periods of time that 

maybe relevant in adapting the second classifier 130. 

In one exemplary aspect of the present invention, the second classifier 130 may be 
constructed by adapting the first classifier 120. By way of illustration, one or more data 
25 stores, data structures, processes, threads, algorithms and rules associated with the first 

classifier 120 maybe updated to reflect personalization associated with the second 
classifier 130. Alternatively, and/or additionally, one or more new data stores, data 
structures, processes, threads, algorithms and rules, may be created and associated with 
the second classifier 130. The second classifier 130, whether a stand alone information 
30 classifier or an extension and/or modification of the first classifier 120 is adapted to 

provide more personalized classification of messages. By way of illustration, a 
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programmer may be employed in a group working on a compiler project. Thus, the 
programmer may receive a higher proportion of messages dealing with compiler issues, 
and may receive a higher proportion of messages from a certain group of individuals. 
These types of messages may be deemed to be more urgent for the programmer. Both the 
5 project and the group of individuals may employ specialized and/or localized language 

and/or semantic methods that are recognizable by a classifier trained with such fine- 
grained data. Thus, a generalized classifier may benefit from training that employs the 
specialized and/or localized language and/or semantic methods. Thus, a determination of 
what messages are urgent may be based, at least in part, on the unique qualities embodied 

10 in fine-grained data associated with the compiler project. 

Subsequently, the programmer may be transferred to a different group working on 
a user interface project. Thus, the classifier that was personalized during the compiler 
project may benefit from retraining through exposure to the specialized language and/or 
semantic methods employed on the new project and by the new group of information 

1 5 generators. The second classifier 130 may thus be retrained to minimize the 

personalization achieved during the compiler project and to maximize the personalization 
achieved during the user interface project. 

To achieve such personalization, the second classifier 130 maybe presented with 
messages from a data store 150 holding adapting data and an adapting component 140 

20 may update one or more data stores, data structures, processes, threads, algorithms and 

rules, for example. Updating the one or more data stores, data structures, processes, 
threads, algorithms and rules can alter the classification probabilities produced by the 
second classifier 130, which facilitates personalizing the second classifier 130. 

The data store 150 may include explicit data provided to user of the second 

25 classifier 1 30 for the purpose of training the personalizable classifier. By way of 

illustration, the user of the second classifier 130 may receive a set of two hundred 
messages that the user may classify by hand, with the user's classifications employed to 
adapt the second classifier 130. The data store 150 may additionally and/or alternatively 
include data gathered by observing user behavior in working with received information. 

30 By way of illustration, the second classifier 1 30 may receive the message 1 1 0 and make a 
prediction concerning the classification of the message 130. Such a prediction may then 
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be compared to user actions including, but not limited to, immediately looking at the 
message and responding to it, immediately looking at the message and archiving it, 
immediately looking at the message and discarding it, and letting it sit in an in-box for a 
period of time. Such actions, and other data gathered by observing the recipient of the 
5 message 1 10 may be compared to the classification prediction made by the second 

classifier 130 with the relationship between the observed action and the prediction being 
employed to update the second classifier 1 30. By way of illustration, if the predicted 
measure matches the observed action then one or more parameters, data stores, data 
structures, processes, threads, algorithms and rules, for example, may be updated to 
1 0 reinforce the decision making process that produced the matching measure. But if the 

predicted measure does not match the observed action, then one or more parameters, data 
stores, data structures, processes, threads, algorithms and rules, for example, may be 
updated to de-emphasize the decision making process that produced the non-matching 
measure. 

1 5 The system 1 00 includes a weighting component 1 70 that can accept a first 

measure produced by the first classifier 120 and assign a first weight to that first measure. 
Similarly, the weighting component 170 can accept a second measure produced by the 
second classifier 130 and assign a second weight to that second measure. A combining 
component 160 can take the first measure produced by the first classifier 120, the first 

20 weight assigned to the first measure, the second measure produced by the second 

classifier 130 and the second weight assigned to the second measure and produce an 
integrated measure 190 based, at least in part, on the two measures and the two weights. 
In one exemplary aspect of the present invention, the sum of the first and second weights 
will equal one. Thus, an example equation employed to produce the integrated measure 

25 1 90 may resemble: 

CP 190 = (pl*w2) + (p2*(l-w2)) 

Where CPi 90 is the integrated measure, pi is a first measure produced by the first 
classifier 120, wl is the first weight assigned to the first measure, p2 is the second 
measure produced by the second classifier 130 and w2 is the second weight assigned to 
30 the second measure. In one example of the present invention, the two measures being 

combined may be, for example, a score produced by a default filter and a score produced 
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by a personalized filter. Thus, the first measure may be an urgency score computed by 
using a default filter and the second measure may be an urgency score computed by using 
a personalized filter. In one example of the present invention, the integrated measure 
CP ]9o maybe normalized to a range substantially similar to the range(s) associated with 
5 the measures being integrated. 

The weights may be determined based, for example, on the degree to which the 
second classifier 130 has been personalized. By way of illustration, at the point in time 
when the second classifier 130 receives its first adapting data message, there may be a 
very low confidence that the measure produced by the second classifier 130 is valid. 

1 0 Thus, the second weight associated with the measure produced by the second classifier 

1 30 may be very low while the first weight associated with the measure produced by the 
first classifier 120 may be very high, and thus the integrated measure 190 will depend, to 
a much greater degree, on the measure produced by the first classifier 120. By way of 
further illustration, at a later point in time, when the second classifier 130 has received 

1 5 half of the amount of messages deemed necessary to produce a valid measure, there may 

still be a low confidence in the measure produced by the second classifier, but that 
measure may have some validity. Thus, the second weight associated with the measure 
produced by the second classifier 130 may be increased slightly while the first weight 
associated with the measure produced by the first classifier 120 may be decreased 

20 slightly, leaving the integrated measure 1 90 still substantially dominated by the measure 

produced by the first classifier 120. By way of still further illustration, at a still later 
point in time, when the second classifier 130 has received more than the amount of 
messages deemed necessary to produce a valid measure, there may be a high confidence 
in the measure produced by the second classifier 1 30. Thus, the second weight associated 

25 with the measure produced by the second classifier 1 30 may be increased to a point 

where it substantially dominates the integrated measure 190 produced by the combining 
component 160 and the first weight associated with the measure produced by the first 
classifier 120 may be decreased to a point where it plays substantially no part in the 
integrated measure 190 produced by the combining component 160. Thus, while the 

30 second classifier 1 30 is being personalized, the measure associated with classifying the 

message 1 10 may be influenced to varying degrees by the first classifier 120, that was 
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pre-trained with general, coarse-grained information {e.g., information with features that 
have relatively universal discriminatory relationships for multiple people), and by the 
second classifier 130, that is being trained with more specific, finer-grained information. 
The system 100 also includes an aging component 1 80 that can be employed to 
5 affect the relevance of data employed in calculating measures by the first classifier 120 

and/or the second classifier 130. In one exemplary aspect of the present invention, the 
aging component 1 80 may achieve such relevance altering effects by manipulating one or 
more weights assigned and/or employed by the weighting component 1 70. It is to be 
appreciated that the aging component 1 80 may also achieve such relevance altering 

1 0 effects by manipulating items including, but not limited to, one or more data stores, data 

structures, processes, threads, algorithms and rules associated with the first classifier 120 
and/or the second classifier 1 30. By way of illustration, the aging component 1 80 may 
examine a weight assigned by the weighting component 170 and determine that the 
information employed to produce the generated weight is from a time period that 

1 5 indicates that its relevance should be decreased as compared to data from a more relevant 

time period. In a simple example, older data may be replaced by newer data. In a more 
complicated example, referring to the programmer who first worked on a compiler 
project and then worked on a user interface project, the relevance of messages received 
since the job change may be more relevant to training the second classifier 130 and/or to 

20 producing the integrated measure 1 90, and thus the weight associated with the measure 

produced by the second classifier 130 maybe updated, and the adapting component 140 
may vary its updating processes based on the relevance of the message. But if the 
programmer subsequently returns to the compiler project, then the relevant time period 
may similarly be returned to personalization achieved during the programmer's first 

25 iteration on the compiler project. 

The personalization process applied to the second classifier 130 may progress in a 
desired manner, but there can be situations where the personalization process may not 
progress in a desired manner. Thus, the present invention provides, in one example of the 
present invention, via the inclusion of temporal data associated with updates to the 

30 classifier being personalized, for rescinding the personalization. The result of such 

rescinding is that the second classifier 130 and/or one or more data stores, data structures, 
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processes, threads, algorithms and/or rules associated with the second classifier 130 are 
returned to an earlier state (e.g., its original state). Such temporal data also facilitates 
producing an information classifier that is personalized based on different periods of 
time. For example, a user of the second classifier 130 may experience a different 
5 environment and thus be exposed to different types of messages that have different 

classifications during different periods of time. By way of illustration, an attorney may 
receive primarily law-related messages at work, but may receive primarily personal 
messages at home. Training performed during the different periods of time may have 
different affects on the second classifier 130. Thus, the attorney may desire different 
10 information classifying based on the different periods of time during which information is 
received. By way of further illustration, a contract nurse may be employed in a first 
hospital for a first period of three months, and be exposed to messages associated with 
O that hospital during that time, while the contract nurse may be employed in a second 

m hospital for a second period of six months and be exposed to messages associated with 

ifi 1 5 the second hospital during that time. Thus, training performed during the two periods of 

*S time may affect the second classifier 130 in different ways. More generally, during a first 

y, period of time, an information recipient may be exposed to a first environment and 

:L ; receive a first type of information that leads to a first personalization of the information 

ffi classifier. But during one or more second periods of time the information recipient may 

j-J 20 be exposed to one or more second environments and receive one or more second types of 

*fj information that lead to one or more second types of personalization. The inclusion of 

temporal data facilitates specifying periods of time for which the user of the second 
classifier 130 desires personalization that was performed to be employed in calculating a 
measure. 

25 Thus, by facilitating training an information classifier to produce information 

classifications based on finer-grained data associated with attributes more personal to the 
user of the information classifier, accuracy improvements over conventional information 
classification systems are achieved. Further, by facilitating retraining, and by facilitating 
specifying relevant periods of time, the present invention provides flexibility 

30 improvements over conventional systems. 
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Referring now to Fig 2, a simulated screen shot 200 illustrates user interface 
elements that may be employed to support various aspects of the present invention. Such 
user interface elements may be employed because an information recipient employing a 
personalizable information classifier may desire information concerning the 

5 personalization process, and may further desire to have some input into the 

personalization process. For example, a user may desire to know to what degree the 
information classifier has been personalized, how many messages are required to 
personalize the information classifier and what periods of time are being employed in 
calculating probabilities. Thus, the screen shot 200 illustrates a user interface graph 

1 0 element 2 1 0 that can be employed to output information concerning the degree to which 
the personalization process has progressed (e.g., 75% personalized). While a line graph 
is illustrated, it is to be appreciated that other graphs (e.g., bar graphs, pie charts) and 
other display elements may be employed in the present invention. 

The screen shot 200 further illustrates a user interface slider element 220 that can 

1 5 be employed to display information concerning the size of the adapting data set to be 
employed in determining the relative weighting of classifications from the pre-trained 
classifier and the classifier being personalized (e.g., sliding control for adapting data set 
size between 100 and 1000 messages) and which may alternatively and/or additionally be 
employed to input information employed in controlling the size of the adapting data set to 

20 be employed in determining the relative weightings. For example, the slider element 220 

may be positioned to indicate that the adapting data set size is currently three hundred 
and fifty messages and then a user of the information classifier being personalized may 
reposition the slider to indicate that the data set size should be increased to five hundred 
messages. 

25 The screen shot 200 further illustrates a user interface sliding element 230 that 

may be employed to display information concerning how many messages are available to 
be removed to force retraining and for inputting information concerning how many 
messages are to be removed and/or de-emphasized to force retraining of the information 
classifier. By way of illustration, the slider element 230 may be positioned to indicate 

30 that one hundred messages may be removed before a confidence associated with a 

measure will be effected. A user of the personalized classifier may then reposition the 
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slider to indicate that five hundred messages should be removed, which may force the 
classifier to begin retraining, and which may affect the confidence associated with a 
measure produced by the information classifier being retrained. Although user interface 
elements 220 and 230 are illustrated as sliders, it is to be appreciated that other user 
5 interface elements (e.g., dials, buttons, text input areas, speech input/output elements) 

may be employed with the present invention. The inclusion of user interface elements 
220 and 230 provide for ease of use improvements over conventional systems. 

The screen shot 200 also illustrates a user interface calendar element 240 that can 
be employed to display information concerning the period of time for which 
1 0 personalization is being employed in computing measures. The calendar element 240 can 
also be employed to input data indicating one or more periods of time for which 
personalization should be employed in computing measures. For example, a user of the 
D calendar element 240 may be able to specify that personalization occurring during a 

(33 certain three week period of time is to be emphasized while personalization occurring 

la 1 5 during a certain two week period of time is to be de-emphasized. Thus, improvements in 

flexibility over conventional systems are achieved. 

As one application of the user interface elements illustrated in screen shot 200, 
™ consider that as personalization progresses, information more recently employed to 

ffj personalize the classifier may be more relevant than information less recently employed 

m 20 to personalize the classifier. Thus, the present invention provides for aging out 
y information, and/or the influence of such information, from the classifier. By way of 

illustration, if five hundred messages are required to be observed before results from a 
personalized classifier are considered valid, then once five hundred messages have been 
received, recently received messages may cause messages received at an earlier point in 
25 time to be removed from the adapting data set. Such "aging out" may be performed by 

methods including, but not limited to, physical removal and removal by weighting factor. 
Physical removal may involve deleting a record from a data store associated with the 
classifier while removal by weighting factor may involve applying a formula to a data 
record, where the relevance of the data record is modeled by a weighting factor, with data 
30 records that are older receiving reduced weighting factors, potentially down to a 

weighting factor that substantially removes the data record from a classification decision. 
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The user interface elements included on screen shot 200 facilitate understanding the 
degree to which such personalization has progressed, and provide mechanisms for 
controlling the rate at which personalization will occur (by controlling the size of the 
required adapting data set before confidence is achieved), the points in time at which 

5 retraining will occur (by rescinding the affect of messages) and the periods of time for 

which training will be relevant (by specifying emphasize/de-emphasize time periods). 
Thus, improvements in flexibility, ease of use, and accuracy over conventional systems 
are achieved. Such temporal based processing is facilitated by storing information 
concerning the age of an item. Furthermore, information including, but not limited to, the 

1 0 creation date of an item, the receipt data of an object, dates before which an item is or is 

not to be considered and dates after which an item is or is not to be considered can be 
stored to facilitate such temporal processing. 

Referring now to Fig. 3, a system 300 for personalizing an information classifier 
is illustrated. The system 300 includes an information classifier 320 that includes both a 

1 5 first classifying component 320A and a second classifying component 320B residing in 

the same classifier 320. The system 300 receives a message 310 that may come from a 
data store 360 holding adapting data and/or the message 3 1 0 may come from an external 
source and be written to the data store 360. The information classifier 320 produces a 
measure 370 that the message 310 can be classified in a certain way. By way of 

20 illustration, the information classifier 320 may produce a measure related to a message 

being relevant to a task in which an information recipient is currently engaged, which 
may be employed to determine whether to interrupt the recipient with the message, for 
example. By way of further illustration, the measure 370 can be employed in decisions 
including, but not limited to, how to route information, when to send information, when 

25 to choose a modality for a communication, when to backup information, when to encrypt 

information, and the like. 

The information classifier 320 may initially function as a generalized information 
classifier and may, over time, be adapted to function as a more personalized information 
classifier, that takes into account the preferences, habits and unique characteristics of an 

30 information recipient and characteristics associated with information sent to an 

information recipient. Thus, the information classifier 320 may rely on previous 
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knowledge employed to train the information classifier 320 to function as a general 
classifier and may also rely on subsequent knowledge employed to train the information 
classifier 320 to function as a personalized information classifier. A division between the 
previous knowledge and the subsequent knowledge may be achieved, for example, by 
5 having the first classifying component 320A process the previous knowledge and by 
having the second classifying component 320B process the subsequent knowledge. 
Although the processing may be divided between the first classifying component 320A 
and the second classifying component 320B, it is to be appreciated that both classifying 
components may access similar and/or identical data stores, data structures, processes, 
1 0 threads, algorithms and rules, for example, to process the previous and subsequent 
knowledge. It is to be further appreciated that the previous knowledge and the 
subsequent knowledge may be stored in similar and/or identical data stores, data 
structures, processes, threads, algorithms and rules, for example. It is to be even further 
appreciated that although two classifying components 320A and 320B are illustrated, that 
1 5 a greater and/or lesser number classifying components may be employed in accordance 

with the present invention. 

Processing performed by the information classifier 320 may be affected by inputs 
from a weighting component 330, an aging component 340 and an adapting component 
350. By way of illustration, the weighting component 330 may determine that processing 
20 performed by the first classifying component 320A is more relevant to a particular 
information classification than processing performed by the second classifying 
component 320B, and thus may more heavily weight the processing performed by the 
first classifying component 320A so that the measure 370 is more heavily influenced by 
the first classifying component 320A. 
25 By way of further illustration, the aging component 340 may determine that 

information employed in determining the measure 370 has reached an age where it 
should be gradually de-emphasized in further probability processing. Thus, the aging 
component 340 may update one or more data stores, data structures, processes, threads, 
algorithms and rules associated with the information classifier 320 to alter how the 
30 measure 370 is produced. In a simple example, older data may have its relevance 
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reduced as compared to newer data. In a more complicated example, data that is unique 
may have its relevance increased as compared to repetitive data. 

By way of still further illustration, the adapting component 350 may determine 
that processing performed by the information classifier 320 can be adapted to produce a 

5 different degree of personalization. For example, a user may indicate that retraining 

should occur, and thus the adapting component 350 may update one or more data stores, 
data structures, processes, threads, algorithms and rules to rescind personalization applied 
to the information classifier 320. Conversely, a user may indicate that no more 
personalization should occur, and thus the adapting component 350 may update data 

1 0 stores, data structures, processes, threads, algorithms and rules to prevent further 

personalization. Furthermore, a user may determine that personalization that occurred 
during a first period of time should be enhanced while personalization that occurred 
during a second period of time should be de-emphasized, and thus the adapting 
component 350 may update data stores, data structures, processes, threads, algorithms 

1 5 and rules to account for the user preferences. Since the information classifier 320 may 

employ techniques known in the art like support vector methodology, naive Bayesian 
processing methodology, sophisticated Bayesian processing methodology, similarity 
analysis employing dot product and/or cosine function processing and decision tree 
processing, it is to be appreciated that updating the one or more data stores, data 

20 structures, processes, threads, algorithms and rules may vary based on the information 

classification technique(s) employed by the information classifier 320. 

Turning now to Fig. 4, a graph 400 illustrates the progression of the 
personalization of an information classifier. The x axis of the graph 400 represents the 
amount of adapting data that has been employed to personalize the information classifier. 

25 The y axis of the graph 400 represents a confidence level that can be associated with the 

measure produced by the information classifier being personalized. A curve 410 is 
illustrated rising from an initial zero level and increasing in relationship to the amount of 
data employed to personalize the information classifier. A certain amount of data K is 
represented by the vertical line 420. At the point in time when the curve 410 crosses the 

30 line 420, an acceptable confidence level associated with the measure produced by the 

information classifier may be achieved. The present invention allows a user of an 
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information classifier to manipulate K so that line 420 may be moved to the left or the 
right on the x axis of graph 400. Thus, the present invention provides flexibility gains 
over conventional systems. By way of illustration, a first user of an information classifier 
may determine that the generalized information classifier provides acceptable results, and 
5 thus a low value for K may be established, reducing the training time for the information 

classifier being trained. But a second user of an information classifier may determine 
that the generalized information classifier requires extensive training before acceptable 
personalized results are achieved, and thus a high value for K may be established. 

It is to be appreciated that although graph 410 illustrates a relationship between 
1 0 the size of the adapting data set, that other relationships may also be employed to affect 

the personalizing of an information classifier. For example, in another relationship, the x 
axis of the graph 400 may represent the coverage of experienced adapting data, where 
coverage relates to how many of a pre-determined set of features have been experienced 
in the adapting data set. The coverage may relate, for example, to the presence or 
1 5 absence of one or more pre-determined linguistic features in the adapting data. By way 

of illustration, an adapting data set that is substantially homogenous may not provide a 
sufficient variety between linguistic features to partition the adapting data into sets that 
can be employed to classify a message. If substantially all the adapting data received is 
the same, it may be meaningless to distinguish between such messages, or distinctions 
20 may be made on inappropriate features. Thus, the coverage of the adapting data set may 

be low, and the confidence in probabilities associated with the adapting data set may 
similarly be low. Conversely, if an adapting data set that is rich in variety between 
linguistic, structural and/or semantic features is presented to an information classifier 
being personalized, then the coverage may be high and the confidence in a measure 
25 produced by an information classifier exposed to such high coverage data set may be 
correspondingly higher. 

In view of the exemplary systems shown and described above, methodologies, 
which may be implemented in accordance with the present invention will be better 
appreciated with reference to the flow diagrams of Figs. 5 and 6. While for purposes of 
30 simplicity of explanation, the illustrated methodologies are shown and described as a 

series of blocks, it is to be understood and appreciated that the present invention is not 
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limited by the order of the blocks, as some blocks may, in accordance with the present 
invention, occur in different orders and/or concurrently with other blocks from that 
shown and described herein. Moreover, not all illustrated blocks may be required to 
implement a methodology in accordance with the present invention. Further, additional 
5 and/or alternative methodologies may employ additional blocks, not illustrated herein. 

The invention may be described in the general context of computer-executable 
instructions, such as program modules, executed by one or more components. Generally, 
program modules include routines, programs, objects, data structures, etc. that perform 
particular tasks or implement particular abstract data types. Typically the functionality of 
1 0 the program modules can be combined or distributed as desired in various embodiments. 
Furthermore, computer executable instructions operable to perform the methods 
described herein can be stored on computer readable media. 

Turning now to Fig. 5, a flow chart illustrates a method 500 for personalizing an 
information classifier. At 510, general initializations occur. Such initializations can 
1 5 include, but are not limited to, allocating memory, establishing pointers, establishing data 

communications, acquiring resources, setting variables and displaying process activity. 
At 520, a message to be classified is received. The message may arrive, for example, 
from an adapting data set, which may include both explicit messages and observed 
messages, or from an external source. At 530, a first measure associated with the 
20 message of 520 is produced. The first measure may be produced by techniques 

including, but not limited to, support vector methodology, a naive Bayesian processing 
methodology, a sophisticated Bayesian processing methodology, a similarity analysis 
employing dot product and/or cosine function processing and decision tree processing. 
The first measure may also be based, for example, on previous knowledge, where such 
25 previous knowledge may be general in nature. For example, the previous knowledge 

may concern data including, but not limited to, header structure, indicia of junk mail 
{e.g., percentage non-alphanumeric characters, capitalization patterns), relationships in an 
organization chart, length of messages, times, dates and other linguistic features like 
tense usage, presence and number of questions. 
30 At 540 a second measure associated with the message of 520 is produced. The 

second measure may be produced by techniques including, but not limited to, support 
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vector methodology, a naive Bayesian processing methodology, a sophisticated Bayesian 
processing methodology, a similarity analysis employing dot product and/or cosine 
function processing and decision tree processing. The second measure may also be 
based, for example, on subsequent knowledge, where such subsequent knowledge may be 

5 specific in nature. For example, the subsequent knowledge may concern data including, 

but not limited to detailed phrases and/or single words. The measures may be associated 
with properties including, but not limited to, the probability that a message has a certain 
classification, the priority of a message, the urgency of a message and the computed 
expected urgency of a message, for example. 

1 0 At 550 the measures of 530 and 540 may be combined to produce a combined 

measure. The combination may be affected by weights associated with each of the 
measures. By way of illustration, if a higher confidence exists for the measure of 530, 
then a higher weight may be associated with that measure while a lower weight may be 
associated with the measure of 540. But if a very high confidence exists for the measure 

1 5 of 540, then the weights may be adjusted so that substantially all of the measure of 550 is 

determined by the measure of 540 and substantially none of the measure of 550 is 
determined by the measure of 530. The measures being combined may be, for example, 
measures produced by a default filter and a personalized filter. Thus, the weights 
assigned to the measures may be employed to choose between the default filter and the 

20 personalized filter, for example. 

At 560, the message of 520, and/or information associated with message (e.g., 
time received, length of message, sender, probabilities) (hereinafter "message data") may 
be stored. At 570, one or more stored messages, and/or message data associated with the 
stored messages may be updated to reflect changes in the relevance of such stored 

25 messages and/or message data. By way of simple illustration, older messages and/or 

message data may be updated to reduce their relevance, while newer data may be updated 
to increase their relevance. Similarly, messages and/or message data associated with 
messages that increase the coverage of training data may be updated to increase their 
relevance while repetitive messages and/or message data may be updated to decrease 

30 their relevance. 
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At 580, the system(s) and/or method(s) that were employed in 540 to produce the 
second measure are adapted. Such adaptation functions to personalize the information 
classification associated with producing the second measure. The update may be based, 
for example, on the relationship between the measure of 530 and the measure of 540. 
The update may also be based, for example, on a relationship between the measure 540 
and explicit classification instructions provided by a user during supervised learning, for 
example. 

At 590 a determination is made concerning whether another message is to be 
processed. If the determination is no, then processing concludes, otherwise processing 
returns to 520. 

Turning now to Fig. 6, a flow chart illustrates another methodology 600 employed 
in personalizing an information classifier. At 610, general initializations occur. Such 
initializations can include, but are not limited to, allocating memory, establishing 
pointers, establishing data communications, acquiring resources, setting variables and 
displaying process activity. At 620, information concerning the progress of 
personalization of an information classifier is provided. For example, a graph illustrating 
the degree to which the information classifier has been personalized may displayed. The 
information concerning the progress of personalization may include information 
concerning a variable K, which is associated with the adequacy of the adapting data 
employed in personalizing the information classifier. For example, the variable K may 
be related to the size of an adapting data set and/or the coverage achieved by an adapting 
data set. 

At 630, a determination is made concerning whether the personalizer of the 
information classifier desires to adjust K. If the determination at 630 is YES, then at 640 
K can be adjusted. For example, data may be displayed concerning the current status of 
K and the effects of adjusting K, and a new value for K may be input. If the 
determination at 630 is NO, then processing proceeds to 650. At 650, a determination is 
made concerning whether changes to time phases associated with the personalization of 
the information classifier are desired. For example, a user may determine that 
personalization achieved during a first period of time is to be de-emphasized while 
personalization achieved during a second period of time is to be emphasized. If the 
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determination at 650 is YES, then at 660 processing to adjust the time phases occurs. For 
example, the user may specify that personalization achieved during a first period of time 
is to be de-emphasized while personalization achieved during a second period of time is 
to be emphasized. If the determination at 650 is NO, then processing proceeds to 670. 

5 At 670 a determination is made concerning whether an information personalizer 

desires to force retraining of an information classifier. While this application employs 
the terms "user" and "information personalizer", it is to be appreciated that such users 
and information personalizers may be human, mechanical, electronic and/or computer 
based entities. If the determination at 670 is YES, then at 680, processing associated with 

1 0 forcing retraining may be commenced. For example, K may be adjusted, one or more 
data stores, data structures, processes, threads, algorithms and/or rules maybe reset to 
previous states and one or more messages and/or message data may be deleted. Such 
deletion maybe physical (e.g., resetting bits) or logical (e.g., setting a flag, adjusting a 
weight). 

1 5 At 690 a determination is made concerning whether the current information 

classifier modification session is completed. If the determination is YES, processing 
concludes, otherwise, processing returns to 620 where another iteration can begin. 

In order to provide additional context for various aspects of the present invention, 
Fig. 7 and the following discussion are intended to provide a brief, general description of 

20 a suitable computing environment 7 1 0 in which the various aspects of the present 

invention may be implemented. While the invention has been described above in the 
general context of computer-executable instructions that may run on one or more 
computers, those skilled in the art will recognize that the invention also maybe 
implemented in combination with other program modules and/or as a combination of 

25 hardware and software. Generally, program modules include routines, programs, 

components, data structures, etc. that perform particular tasks or implement particular 
abstract data types. Moreover, those skilled in the art will appreciate that the inventive 
methods maybe practiced with other computer system configurations, including single- 
processor or multiprocessor computer systems, minicomputers, mainframe computers, as 

30 well as personal computers, hand-held computing devices, microprocessor-based or 
programmable consumer electronics, and the like, each of which maybe operatively 
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coupled to one or more associated devices. The illustrated aspects of the invention may 
also be practiced in distributed computing environments where certain tasks are 
performed by remote processing devices that are linked through a communications 
network. In a distributed computing environment, program modules may be located in 
5 both local and remote memory storage devices. 

With reference to Fig. 7, an exemplary environment 710 for implementing various 
aspects of the invention includes a computer 712, the computer 712 including a 
processing unit 7 1 4, a system memory 7 1 6 and a system bus 7 1 8 . The system bus 7 1 8 
couples system components including, but not limited to the system memory 71 6 to the 
1 0 processing unit 714. The processing unit 7 1 4 may be any of various commercially 

available processors. Dual microprocessors and other multi-processor architectures also 
can be employed as the processing unit 714. 

The system bus 718 can be any of several types of bus structure including a 
memory bus or memory controller, a peripheral bus and a local bus using any of a variety 
1 5 of commercially available bus architectures. The system memory 722 includes read only 

memory (ROM) 720 and random access memory (RAM) 722. A basic input/output 
system (BIOS), containing the basic routines that help to transfer information between 
elements within the computer 712, such as during start-up, is stored in ROM 720. 

The computer 712 further includes a hard disk drive 724, a magnetic disk drive 
20 726, (e.g., to read from or write to a removable disk 728) and an optical disk drive 730, 
(e.g., for reading a CD-ROM disk 732 or to read from or write to other optical media). 
The hard disk drive 724, magnetic disk drive 726 and optical disk drive 730 can be 
connected to the system bus 71 8 by a hard disk drive interface 734, a magnetic disk drive 
interface 736 and an optical drive interface 738, respectively. The drives and their 
25 associated computer-readable media provide nonvolatile storage of data, data structures, 

computer-executable instructions, etc. for the computer 712, including for the storage of 
broadcast programming in a suitable digital format. Although the description of 
computer-readable media above refers to a hard disk, a removable magnetic disk and a 
CD, it should be appreciated by those skilled in the art that other types of media which 
30 are readable by a computer, such as zip drives, magnetic cassettes, flash memory cards, 

digital video disks, cartridges, and the like, may also be used in the exemplary operating 
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environment, and farther that any such media may contain computer-executable 
instructions for performing the methods of the present invention. 

A number of program modules can be stored in the drives and RAM 722, 
including an operating system 740, one or more application programs 742, other program 
5 modules 744 and program data 746. It is to be appreciated that the present invention can 
be implemented with various commercially available operating systems or combinations 
of operating systems. 

A user can enter commands and information into the computer 712 through a 
keyboard 748 and a pointing device, such as a mouse 750. Other input devices (not 

1 0 shown) may include a microphone, an IR remote control, a joystick, a game pad, a 

satellite dish, a scanner, or the like. These and other input devices are often connected to 
the processing unit 714 through a serial port interface 752 that is coupled to the system 
bus 718, but may be connected by other interfaces, such as a parallel port, a game port, a 
universal serial bus ("USB"), an IR interface, etc. A monitor 754 or other type of display 

15 device is also connected to the system bus 718 via an interface, such as a video adapter 

756. In addition to the monitor, a computer typically includes other peripheral output 
devices (not shown), such as speakers, printers etc. 

The computer 712 may operate in a networked environment using logical 
connections to one or more remote computers, such as a remote computer(s) 758. The 

20 remote computers) 758 may be a workstation, a server computer, a router, a personal 

computer, microprocessor based entertainment appliance, a peer device or other common 
network node, and typically includes many or all of the elements described relative to the 
computer 712, although, for purposes of brevity, only a memory storage device 760 is 
illustrated. The logical connections depicted include a local area network (LAN) 762 

25 and a wide area network (WAN) 764. Such networking environments are commonplace 

in offices, enterprise- wide computer networks, intranets and the Internet. 

When used in a LAN networking environment, the computer 7 1 2 is connected to 
the local network 762 through a network interface or adapter 766. When used in a WAN 
networking environment, the computer 712 typically includes a modem 768, or is 

30 connected to a communications server on the LAN, or has other means for establishing 

communications over the WAN 764, such as the Internet. The modem 768, which may 
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be internal or external, is connected to the system bus 718 via the serial port interface 
752. In a networked environment, program modules depicted relative to the computer 
7 1 2, or portions thereof, may be stored in the remote memory storage device 760. It will 
be appreciated that the network connections shown are exemplary and other means of 
5 establishing a communications link between the computers may be used. 

What has been described above includes examples of the present invention. It is 
of course, not possible to describe every conceivable combination of components or 
methodologies for purposes of describing the present invention, but one of ordinary skill 
in the art may recognize that many further combinations and permutations of the present 

10 invention are possible. Accordingly, the present invention is intended to embrace all 

such alterations, modifications and variations that fall within the spirit and scope of the 
appended claims. Furthermore, to the extent that the term "includes" is used in either the 
detailed description or the claims, such term is intended to be inclusive in a manner 
similar to the term "comprising", as comprising is interpreted as a transitional word in a 

1 5 claim. 
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